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Targeted Myostatin Gene Editing in IVIultiple IVIammalian 
Species Directed by a Single Pair of TALE Nucleases 

Li Xu^ Piming Zhao\ Andrew Mariano^ and Renzhi Han^ 

Myostatin (i\/ISTN) is a negative regulator of sl<eletal muscle mass. Strategies to block myostatin signaling pathway have been 
extensively pursued to increase muscle mass in various disease settings including muscular dystrophy. Here, we report a 
new class of reagents based on transcription activator-like effector nucleases (TALENs) to disrupt myostatin expression at 
the genome level. We designed a pair of MSTN TALENs to target a highly conserved sequence in the coding region of the 
myostatin gene. We demonstrate that codelivery of these MSTN TALENs induce highly specific and efficient gene disruption 
in a variety of human, cattle, and mouse cells. Based upon sequence analysis, this pair of TALENs is expected to be functional 
in many other mammalian species. Moreover, we demonstrate that these MSTN TALENs can facilitate targeted integration of 
a mCherry expression cassette or a larger muscular dystrophy gene (dysferlin) expression cassette into the MSTN locus in 
mouse or human cells. Therefore, targeted editing of the myostatin gene using our highly specific and efficient TALEN pair 
would facilitate cell engineering, allowing potential use in translational research for cell-based therapy. 
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Introduction 

Myostatin (MSTN) is a transfornning growth factor-p fannily 
member that plays a critical role in negatively regulating skel- 
etal muscle mass.^ Genetic studies have demonstrated that 
myostatin gene deficiency leads to muscle hypertrophy due 
to a combination of increased fiber numbers and increased 
fiber sizes in multiple species including human,^ cattle,^-^ 
mouse,^ sheep,^ and dog^ without causing severe adverse 
consequences. Therefore, extensive efforts have been 
undertaken to develop effective strategies for blocking the 
myostatin signaling pathway as therapies for various muscle- 
wasting diseases such as muscular dystrophy, sarcopenia, 
and long bedding patients.^-^^ Indeed, myostatin inhibitors 
have shown great promise to significantly increase muscle 
growth in model animals. ^'^^"^^ 

Targeting the MSTN gene would provide a permanent 
solution to block myostatin signaling. However, conventional 
gene targeting approach has been limited to mouse embry- 
onic stem cells and not readily adaptable for most other cell 
types because of the extremely low targeting frequency. 
Recent studies have shown that targeted genome editing 
with minimal toxicity in many different types of cells is pos- 
sible by combining engineered zinc finger nucleases (ZFNs) 
with inherent DNA repair mechanisms within the cell.^^ It has 
been shown that ZFNs promote genome editing via nonho- 
mologous end-joining (NHEJ) and homology-directed DNA 
repair by creating a double-strand break at a specific target 
locus.^^ A typical nuclease is composed of two essential 
domains: the DNA-binding domain and the nonspecific cleav- 
age domain of the Fokl restriction enzyme. The DNA-binding 
domain, which is composed of multiple zinc finger arrays, can 
be re-engineered to bind to a wide variety of DNA sequences, 
making it possible to engineer ZFNs which specifically target 



the user-defined sequences. ZFN-facilitated genome edit- 
ing allows stable integration of therapeutic genes or res- 
toration of mutated genes in specific genetic loci.^^ It thus 
offers a promising approach for treating genetic disorders 
and has gained much research interest recently. Since the 
first seminal publications about ZFNs in the late 1990s,^^'^°'^^ 
many ZFNs have been successfully engineered to perform 
genome editing in cells of several different species, includ- 
ing human and mouse. ZFN-mediated in wVo genome editing 
was recently shown to restore hemostasis in a mouse model 
of hemophilia via adeno-associated virus-mediated delivery 
of ZFNs and a donor gene into the mouse liver, and ZFN- 
mediated CCR5 gene knockout is currently in clinical trial for 
establishing HIV-1 resistance in CD4+ T cells.^^ These excit- 
ing progresses raise the possibility of genome editing as a 
viable strategy to treat diseases caused by genetic mutation. 
However, there is still a lack of an optimal strategy to engi- 
neering highly active and specific ZFNs. 

Recently, a new class of nucleases called transcription 
activator-like effector nucleases (TALENs), which contain 
DNA-binding domains based on transcription activator-like 
effector (TALE) proteins from Xanthomonas plant pathogens, 
have emerged.24-2^ The central repeat domain in the TALE 
structure mediates DNA binding with each repeat specify- 
ing one target base. The base preference of each repeat is 
determined by two critical, adjacent amino acids referred to 
as the "repeat variable di-residue" (RVD) which preferentially 
recognizes one of the four bases in the target site.^^'^^ This 
simple "two amino acids for one base" code enables rapid 
engineering of customized TALE repeat arrays that recog- 
nize a user-defined target sequence. It has been shown that 
unique TALE-binding sites can be found on average every 35 
base pairs,^^ making it highly attractive for scientific laborato- 
ries to practice gene editing in various cell types. 
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In this study, we report the successful engineering of a 
TALEN pair designed to target a highly conserved sequence 
within the coding region of the MSTN gene. High rates of 
MSTN mutations and targeted DNA addition were efficiently 
induced by the TALEN pair in various cell lines of multiple 
species. 

Results 

Design and characterization of IVISTN TALEN 

To design a working TALEN for editing human MSTN gene, 
we analyzed the sequence within exon 2 through the online 
TALEN Targeter program (https://talent.cac.cornell.edu/ 
node/add/talen)2^'2° and selected a potential target site 
(Figure 1a). We assembled the TALEN pair using the Golden 
Gate Platform as described previously^^ in two separate plas- 
mids, each with a WT Fokl domain and expression driven 
by a CMV promoter. Transfection of each of these TALENs 
(GDF8-L or GDF8-R) into HEK293 cells did not result in 
any detectable gene editing activity as measured by T7E1- 
directed mismatch cleavage assay^^ which detects TALEN 
pair-induced NHEJ-mediated small insertions or deletions 
(indels) (Figure 1b). However, cotransfection of the TALEN 
pair efficiently induced indels which can be detected as two 
cleavage bands using T7E1 assay (Figure lb). The fre- 
quency of mutated alleles is estimated to be -19.6% based 
on the gel densitometry. 



^ Left TALEN _ 

5' tGTGCAAATCCTGAGACTCATCAAA(^fATGAAAGACGGTACAAGGTATACTGGa3' 
3' aCACGTTTAGGACTCTGAGTAGTTTGQJtACTTTCTGCCATGTTCCATATGACCtS: 

""-^ Right TALEN 



MSTN locus: 




Figure 1 Construction of MSTN TALENs. (a) Schematic showing 
the design of MSTN TALENs. A TALEN is composed of a DNA 
binding domain and a nonspecific DNA cleavage domain (Fokl). A 
pair of TALENs bind to opposite strands of a DNA double helix. The 
left and right target sites were located at the Exon 2 of human MSTN 
locus, (b) T7E1 assay of the gene-editing activity of the left (GDF8-L) 
and right (GDF8-R) TALENs with a wild-type Fokl domain in human 
HEK293 cells, (c) 2A-mediated self-processing of the left and right 
TALEN monomers linked by a 2A peptide sequence, each harboring 
ELD-sharkey and KKR-sharkey variant of Fokl domain, respectively 
(GDF8-L7R').The 11 OkDa TALEN monomers were detected with a 
FLAG tag antibody We could see only one band at 1 1 0 kDa because 
the two monomers have about the same size. GAPDH was used 
as a loading control, (d) T7E1 assay of the gene-editing activity of 
the GDF8-L/R'. Data shown were representative of at least three 
experiments. 



Previous studies demonstrated that WT Fokl domain 
fused to zinc finger proteins can form homodimers and 
thus induce off-target activities and significant cellular toxic- 
j^y 32-37 jq overcome this limitation, others have developed 
obligate heterodimer variants of the Fokl cleavage domain, 
which greatly reduced the cellular toxicity of ZFNs.^^"^^ 
Therefore, we also assembled the MSTN TALENs using two 
well-characterized obligate heterodimeric variants (Q486E, 
I499L, N496D in the left TALEN and E490K, I538K and 
H537R in the right TALEN)^"^ to reduce homodimer forma- 
tion. The "Sharkey" mutations S418P and K441E22'2^ were 
also introduced to both heterodimer Fokl domains to further 
enhance cleavage activity. The complete sequences of both 
TALENs are provided in Supplementary Figure SI and 
Figure S2 online. The two TALENs were linked by a self- 
cleaving 2A peptide sequence, which allows the transcrip- 
tion and translation of both TALENs at an equal molar ratio. 
Western blotting analysis confirmed faithful expression of 
the ~1 1 0kDa TALEN monomers (Figure 1 c). To a very small 
extent, the 220 kDa full-length precursor protein was visible 
with a high exposure time (data not shown). Transfection 
of this TALENs-encoding plasmid resulted in a mutation 
frequency of -32.2% (Figure Id) in human HEK293 cells, 
significantly higher than that achieved with two separate 
TALEN monomer plasmids comprised of WT Fokl domain. 
We used this plasmid throughout the following studies. 

To examine the dependence of the mutation rate on time 
after TALEN treatment, we performed the T7E1 assay on 
HEK293 cells transfected with the TALEN plasmid at vari- 
ous time points post-transfection. Mutation events can be 
detected at 20 hours after transfection with the mutation fre- 
quency peaking at 32 hours and well maintained throughout 
4 days post-transfection (Figure 2a). This trend is well cor- 
related with the typical expression time course in a transient 
transfection experiment. To test whether TALENs-mediated 
mutations persist during long-period culture, we passaged a 
dish of transfected cells for six generations (total one month) 
and demonstrated that the mutations are maintained after a 
long period of culture (Figure 2b). These data suggest that 
the TALENs-mediated gene mutations are permanent and 
inheritable. 




Indels (%)ND 15.0 23.5 21.7 22.1 20.5 ND 10.0 

Figure 2 MSTN TALENs mediate long-lasting gene editing in 
HEK293 cells, (a) HEK293 cells were transfected with the plasmid 
encoding MSTN TALENs and analyzed by T7E1 assays at 20, 32, 
48, 72, and 96 hours post-transfection. (b) A dish of HEK293 cells 
after transfection with the plasmid encoding MSTN TALENs were 
maintained in culture and passaged for six generations (total 1 
month). The cells were then analyzed byT7E1 assay. Nontransfected 
HEK293 cells were used as control (Ctrl). ND, not detected. Data 
shown were representative of at least three experiments. 
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Figure 3 Gene-editing activities of l\/ISTN TALENs in various cell lines of different species. Various human, bovine and mouse cell 
lines were co-transfected with the plasmid encoding the left and right TALENs linked by a self-cleaving 2A peptide sequence and analyzed 
byT7E1 assay 72 hours post-transfection. (a) HT1080: a human fibrosarcoma cell line; (b) bovine aortic endothelial cells (BAEC); (c) NIH 
3T3: a mouse embryonic fibroblast cell line; (d) C2C12: a mouse myoblast cell line. Nontransfected cells were used as control (Ctrl). ND, not 
detected. Arrow heads indicate the two cleaved bands while the arrow indicates noncleaved band. Data shown were representative of at least 
three experiments. 



Gene editing in other mammalian species using the 
same MSTNTALEN pair 

Since the hunnan target sequence of our MSTN TALENs is 
highly conserved annong different nnannnnalian species, and 
particularly is exactly the sanne in prinnates, several livestock 
aninnals and nnice (see Supplementary Figure S3 online), 
we reasoned that our MSTN TALENs are functional in cells 
of various species. To test this, we transfected the TALENs- 
expressing plasnnid into several cell lines of different spe- 
cies including hunnan HT1080 fibrosarconna cells, bovine 
aortic endothelial cells (BAEC), nnouse NIH 3T3 ennbryonic 
fibroblasts, and nnouse C2C12 nnyoblasts. T7E1 assays per- 
fornned on all hunnan, bovine, and nnouse cells dennonstrated 
high efficiency of gene editing activities induced by this pair 
of TALENs (Figure 3). The nnutation frequency varies fronn 
11 .6 to 21 .6% annong different cell lines. 

The quantification of nnutation frequency is likely underes- 
tinnated since the transfection efficiency in nnost of the exann- 
ined cell lines cannot reach 100%. To overconne the problenn 
of low transfection efficiency, we added an EGFP fusion tag 
in the TALEN construct so that we can enrich positively-trans- 
fected NIH 3T3 cells by FACS. T7E1 assay showed that cell 
sorting increased the gene editing activity by about threefold 
(Figure 4a) (43.7% in sorted cells versus 13.7% in nonsorted 
cells shown in Figure 3), indicating that transfection efficiency 
is a critical factor in nneasuring gene editing activity. In addition 
to transfection efficiency there are other factors, such as the 
gel densitonnetry sensitivity that can influence the nneasured 
gene editing efficiency using the T7E1 assay As an alterna- 
tive approach to exannine the nnutation frequency induced 
by MSTN TALENs, we cloned the DNA surrounding the tar- 
get sites annplified fronn these sorted cells. Fifty clones were 
randonnly picked up for direct DNA sequencing. Thirty-eight 
of these clones showed various deletions at the cleavage 
site (Figure 4b and 4c, and see Supplementary Figure S4a 
online), suggesting that ourTALEN pair induced approxinnately 
76% gene nnodification in the target site. Since NIH 3T3 cells 
are diploid, we estinnated that nnore than 52% of the cells had 
MSTN gene disruption on both alleles (see Supplementary 



a b 




iiiiiiiiiii I ii Miiiiiiiiiiiiii iiiniiiii Mini IN 

GTGCAAATCCTGAGACTCATCAAACCC TAGACGGTACAAGGTATAC TGG 



^ Left binding site Cleavage site Riglit binding site 

WT: GTGCAAATCCTGAGACT CATCAAACCCATGAAAG ACGGTACAAGGTATACTGG 

01: GTGCAAATCCTGAGACT CAT AAAG ACGGTACAAGGTATACTGG 

C2: GTGCAAATCCTGAGACT CATC AAG ACGGTACAAGGTATACTGG 

C3: GTGCAAATCCTGAGACT CATCAAACCC-T AG ACGGTACAAGGTATACTGG 

C4/5: GTGCAAATCCTGAGACT CATCAA TGAAAG ACGGTACAAGGTATACTGG 

C6: GTGCAAATCCTGAGACT CATCAAACC GAAAG ACGGTACAAGGTATACTGG 

C7: GTGCAAATCCTGAGACT CATC AAAG ACGGTACAAGGTATACTGG 

C8: GTGCAAATCCTGAGACT CAT CATGAAAG ACGGTACAAGGTATACTGG 

Figure 4 Sequence analysis of the MSTN locus after MSTN 
TALENs-mediated gene editing, (a) Gene-editing efficiency was 
measured using T7E1 assay in sorted NIH 3T3 cells transfected 
with the EGFP-tagged MSTN TALENs-encoding plasmid. (b) 
Sequencing analysis of the MSTN allele in the sorted NIH 3T3 
cells. DNA fragments surrounding the TALEN target sites were 
PGR amplified from the sorted NIH 3T3 cells and cloned into a 
vector backbone. Fifty clones were randomly picked up for direct 
DNA sequencing. Sequencing data of four clones (C1 , C2, C3 and 
WT [G9]) were shown in panel B. (c) Sequence variations of first 
eight clones at the cleavage site were aligned with WT sequence 
(the sequences of the other clones were listed in Supplementary 
Figure S4a online). As compared with the WT DNA, eight clones 
had various deletions in the cleavage site. Clones C4 and C5 carry 
the same deletions. 

Figure S4b online). Annong the 38 nnutant clones, 24 had 
franne-shift deletions or insertions, and the other 14 had in- 
franne deletions or nnis-sense point nnutations. These statistics 
are consistent with the randonnness of NHEJ DNA repair. 
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High sequence specificity of the IVISTNTALENs 

In our TALEN engineering, the RVD Asn-Asn (NN), which 
recognizes both guanine and adenine,^^ was used for gua- 
nine. One potential problem with the dual recognition by 
NN is an increase of off-target activity We searched the 
potential off-target sites by running the RVDs in the "Paired 
Target Finder" tool maintained by TAL Effector Nucleotide 
Targeter 2.0 (https://tale-nt.cac.cornell.edu/node/add/talef- 
off-paired).2^'2° TALEN-mediated DNA cleavage can occur 
wherever two TALEN monomers bind with the proper spac- 
ing and orientation. Thus, a search for off-target sites must 
consider all four possible combinations of TALEN monomer 
RVD sequences: RVD1+RVD1, RVD1+RVD2, RVD2+RVD1, 
and RVD2+RVD2. But since our TALEN pair used obligate 
heterodimer variants of the Fokl cleavage, the RVD1+RVD1 
or RVD2+RVD2 combinations would form a less active dimer 
(in the case of ELD variant) or nonfunctional dimer. We thus 
only consider RVD1+RVD2 or RVD2+RVD1 combinations. In 
total, ten potential off-target sites with these two combinations 
can be identified in the human genome (see Supplementary 
Table S2 online). Manual examination of these sites revealed 
that at least four different nucleotides within each half site are 
not recognized by RVDs specifically targeting human MSTN 
target site, even if the dual recognition of NN was taken into 
consideration. To directly examine the off-target activity of 
our MSTN TALENs, we selected top three potential target 
sites located on three different chromosomes as listed in see 
Supplementary Table S2 online and used the T7E1 assay 
for each of these sites. As shown in Figure 5, no detectable 
gene editing activities were observed in any of these sites. 
These data suggest that the TALENs-mediated gene editing 
is highly sequence specific. 

TALEN-mediated gene editing in primary mouse and 
human myoblasts 

Next, we tested the efficiency of gene editing in primary 
mouse and human myoblasts using our MSTN TALEN pair. 
We established a myoblast culture from flexor digitorum 
brevis (FDB) muscle of a dysferlin-deficient mouse model. 
Primary human myoblasts from control and dysferlinopathy 
patients were obtained from Telethon Network of Genetic 
Biobanks. These primary myoblasts exhibited low trans- 
fection efficiency with regular lipid or nonlipid transfection 
reagents. To overcome this challenge, we utilized the Neon 
Transfection System (Invitrogen, Carlsbad, CA). The T7E1 
assay confirmed that MSTN TALENs induced high frequency 
(10.3% to 24.6%) of gene editing in all these primary myo- 
blast cultures (Figure 6). 

TALENs-mediated functional disruption of the MSTN 
gene 

To examine the functional outcomes of MSTN TALENs- 
induced gene disruption in muscle cells, we screened six 
individual clones of C2C12 cells that were sorted for posi- 
tive transfection of MSTN TALENs. The genomic DNA of all 
six clones were further analyzed by DNA sequencing. Five 
of these clones carried biallelic myostatin gene disruption 
at the TALEN cleavage site (Figure 7a) while clone #2 was 
WT. Several mutations were in-frame deletion or point muta- 
tion. We selected clones #1 , 2, and 3 for further analysis. The 




Ctrl + Ctrl + Ctrl + Ctrl + 




ND 32.9% ND 



Figure 5 Off-target activities of the IVISTNTALENs. No detectable 
gene-editing activities by the MSTN TALENs were observed at three 
top potential off-target sites located on three different chromosomes 
in HEK293 cells as examined by T7E1 assay. ND, not detected. 




ND 24.6% ND 10.3% ND 14.0% 

Figure 6 MSTN TALENs-mediated gene editing in primary 
human and mouse myoblasts. Gene-editing activity were 
observed in three lines of primary myoblasts: (a) Mouse myoblasts 
derived from FDB muscles, (b) Human control myoblasts and (c) 
dysferlinopathy patient myoblasts after electroporation-mediated 
transfection of MSTN TALENs. ND, not detected. 

cells of these three clones were induced to differentiate for 
3 days by replacing the growth media with the differentia- 
tion media. Dexamethasone was added into the media on 
day 3 and cultured for another three days. Dexamethasone 
is a glucocorticoid known to induce myostatin expression 
and atrophy in C2C12 myotubes.^o Indeed, the WT clone #2 
showed dexamethasone-induced atrophy while the other two 
MSTAZ-mutant clones continued to grow larger even in the 
presence of dexamethasone (Figure 7b and 7c). Western 
blotting analysis demonstrated that myostatin expression 
was disrupted in both clone #1 and 3 as compared with the 
control cells or clone #2 (Figure 7d). These data suggest that 
the expression of myostatin is involved in dexamethasone- 
induced myotube atrophy. 

TALENs-mediated integration of transgenes into the 
MSTN /ocus 

Double-strand breaks can be repaired by homologous 
recombination (HR). Thus, ZFNs and TALENs are valuable 
tools to introduce targeted gene addition to the specific 
genomic locus. To promote gene insertion at the target site, 
a donor plasmid was constructed that contains two homol- 
ogy arms (800 base pairs) which surround the human MSTN 
target site separated by a complete expression cassette for 
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WT: 

Clone 
-8bp: 
-2bp: 
Clone 
-5bp: 
-16bp 
Clone 
-15bp 
-8bp: 
Clone 
-3bp: 
point 
Clone 
-9bp: 
-66bp 



GTGCAAATCCTGAGACT catcaaacccatgaaag ACGGTACAA6GTATACTGG 
1: 

GTGCAAATCCTGAGACT ccatgaaag ACGGTACAAGGTATACTGG 

GTGCAAATCCTGAGACT catca — cccatgaaag ACGGTACAAGGTATACTGG 
3: 

GTGCAAATCCTGAGACT catca atgaaag ACGGTACAAGGTATACTGG 

: GTGCAAATCCTGAG aaag ACGGTACAAGGTATACTGG 

4: 

: GTGCAAATCCTGAGACT CAT -CGGTACAAGGTATACTGG 

GTGCAAATCCTGAGACT CCATGAAAG ACGGTACAAGGTATACTGG 

5: 

GTGCAAATCCTGAGACT CATCAAACC GAAAG ACGGTACAAGGTATACTGG 

: GTGCAAATCCTGAGACT CATCAAACCCATGAAAG ACTGTACAAGGTATACTGG 
6: 

GTGCAAATCCTGAGACT cat gaaag ACGGTACAAGGTATACTGG 

: GTGCAAATCCTGAG 



Clone 1 



Clone 2 



Clone 3 




Figure 7 Disrupted myostatin protein expression in C2C12 cells 
after treatment with IVISTNTALENs. (a) Genomic DNA sequencing 
results of individual C2C12 clones, (b) Bright field micrographs of 
C2C12 myotubes from the three clones on Day 3 and Day 6. The 
cells were treated with 100 nmol/l dexamethasone (Dex) on Day 3 
to 6. Scale bar: 100 |jm. (c) Quantitative measurement of myotube 
diameters. Results are mean ± SEM. **P< 0.01 . (d) Western blotting 
analysis of myostatin expression in three individual C2C12 clones 
after treatment with MSTN TALENs. Arrows indicate the myostatin 
bands -52, 43 and 28kDa, respectively. GAPDH was used as a 
loading control. 

nnCherry-puronnycin driven by a CMV pronnoter (Figure 8a). 
Cotransfection of MSTN TALENs and donor plasnnid into 
HEK293 cells (initial cell nunnber: 5x10^ cells) allows fornna- 
tion of nnCherry-positive colonies after puronnycin selection 
(Figure 8b). Specific integration of the expression cas- 
sette was identified by PGR with prinners (see supplennen- 
tary IVIaterials and IVIethods for details) that bind inside 
nnCherry and inside the MSTN genonnic DNA beyond the 
left honnology arnn. Only specific integration events gener- 
ate a PGR product. No targeted integration events were 
detected in cell pool that was transfected with the donor 



plasnnid alone (Figure 8c). However, when MSTN TALENs 
were cotransfected with the donor plasnnid together, 
nnGherry-puronnycin cassette was efficiently integrated into 
the MSTN locus at the target site (Figure 8c). Sinnilarly, 
we constructed a nnouse version of the donor plasnnid and 
tested TALENs-nnediated integration of nnGherry-puronnycin 
cassette in nnouse G2G12 cells. Again, specific integration 
of the expression cassette was only detected in G2G12 cells 
cotransfected with MSTN TALENs and the donor plasnnid, 
but not in cells transfected with the donor only (see Supple- 
mentary Figure S5 online). 

Next, we exannined the feasibility of targeted integration of 
a larger gene cassette into the hunnan MSTN \ocus facilitated 
by TALENs. For this purpose, we constructed a donor vector 
that carries a nnGherry-puronnycin and enhanced cyan fluo- 
rescent protein (EGFP)-tagged dysferlin, a gene nnutated in 
linnb-girdle nnuscular dystrophy type 2B and Miyoshi nnyopa- 
thy patients, linked by a self-cleaving 2A peptide sequence 
(Figure 9a). The entire expression cassette flanked by the 
left and right honnology arnns is ~9.3kb in length. Godeliv- 
ery of both MSTN TALENs and the donor vector resulted in 
the fornnation of colonies showing both nnGherry and EGFP 
fluorescence after puronnycin selection (initial cell nunnber: 
5x10^) (Figure 9b). Previous studies showed that dysfer- 
lin is prinnarily localized at the plasnna nnennbrane due to 
the presence of a single transnnennbrane donnain in the very 
G-ternninus. Indeed, the EGFP fluorescence showed a typi- 
cal plasnna nnennbrane pattern (see Supplementary Figure 
S6 online) suggesting the full-length dysferlin was expressed. 
PGR genotyping confirnned the targeted integration in pooled 
cells cotransfected with TALENs and donor while no targeted 
integration occurred in donor only cells (see Supplementary 
Figure S7 online). We also picked up four individual colonies 
fronn each transfection group and perfornned PGR genotyping 
analysis. Two colonies that were positive for GFP-dysferlin in 
the cotransfected cells showed the correct PGR product indi- 
cating targeted integration (Figure 9c), and all colonies in the 
donor only group had no predicted PGR product even though 
they nnostly have dysferlin indicating only randonn integration 
in these cells (Figure 9c). 



Discussion 

Gene nnodified cells and aninnals are widely ennployed in 
basic bionnedical research and biotechnological applications. 
This is often achieved by randonn gene integration into the 
genonne either by retroviral transduction, or plasnnid transfec- 
tion and selection for stable clones. These strategies can be 
labor and cost intensive, requiring clone screening to identify 
clones with a suitable expression level. ZFNs and TALENs 
are two new classes of engineered enzynnes for targeted 
genonne editing in various cell types. In this study, we suc- 
cessfully applied the TALEN technology to modify MSTN loci 
in nnultiple nnannnnalian cells using a single highly efficient 
TALEN pair. 

Our MSTN-TALEN pair was engineered based upon Gold- 
yTALEN scaffold which allows greater cleavage and genetic 
nnodification efficiency in connparison with others.^^'^^ Indeed, 
this TALEN pair induced approxinnately 10-30% nnutation 
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Figure 8 iVISTNTALENs facilitate targeted addition of a marker 
gene into the human MSTN locus, (a) A schematic showing 
homologous recombination between genomic DNA and the donor 
DNA carrying a mCherry-puromycin cassette flanked by the left 
and right homology arms (HA-L and HA-R). The arrows indicate 
the relative positions of primers used for genotyping analysis, (b) 
Cotransfection of the donor plasmid and the plasmid encoding 
MSTN TALENs into HEK293 cells resulted in the formation of many 
mCherry-positive clones after puromycin selection for a week. Some 
clones formed in the cells transfected with only donor plasmid do not 
show mCherry fluorescence, indicating random integration events. 
Scale bar = 1 00 |jm. (c) Genotyping analysis of the bulk transfected 
HEK293 cells with no plasmid, donor only, TALENs only, or donor 
plus TALENs, using the primers F1 and R1 or F2 and R2.The cells 
were collected 48 hours after transfection. 

frequency in various cell lines and prinnary nnyoblast cul- 
tures exannined, as nneasured by T7E1 assay on a regular 
TAE agarose gel. The nnutation rates as deternnined are 
obviously underestinnated because the in vitro transfection 
efficiency can never reach 100%. To calculate the gene edit- 
ing efficiency nnore accurately, we fused an EGFP tag with 
the TALENs and the transfected cells were sorted by FAGS 
for EGFR When only EGFP-positive cells were analyzed for 
nnutation rates induced by the TALENs, the estinnated nnuta- 
tion frequency is increased to around 45%. We believe this 
nunnber is still underestinnated due to the fact that additional 
factors, such as fornnation of honnoduplex during reanneal- 
ing process and linnited sensitivity of gel densitonnetry, can 
influence the calculation using the T7E1 assay. As an alter- 
native approach, we randonnly sequenced 50 clones of the 
TALEN-target region and found that 38 of these clones car- 
ried snnall deletions/insertions/point nnutations.This suggests 
that our TALEN pair produced about 76% gene editing fre- 
quency, consistent with previous reports that high genonne 
editing efficiency can be achieved using the GoldyTALEN 
scaffold. ^^"^^ Based on this, we estinnate that at least 52% of 
the cells have double allele nnutation (see Supplementary 
Figure S4 online). This estinnation is further supported by 
experinnental data in G2G12 cells, for which we randonnly 
picked up six clones and identified five of thenn carry double 
allele nnutations (Figure 7a). Our data suggest that it is fea- 
sible to generate double allele knockout cells without drug 
selection using the TALEN approach. 

Our TALEN pair precisely targets Exon 2 of hunnan IVISTN 
locus. Exon 2 is a naturally occurring nnutation site in certain 
species of cattle with extra-developed nnuscles.^ Most of the 
nnutations induced by these TALENs are snnall deletions, dis- 
rupting the reading franne (Figures 4 and 7c) and predicted 
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Figure 9 MSTN TALENs facilitate targeted integration of a 
dysferlin expression cassette into the human MSTN locus, (a) 

A schematic showing homologous recombination between genomic 
DNA and the donor DNA carrying a mCherry-puromycin-2A-ECFP- 
dysferlin cassette flanked by the left and right homologous arms 
(HA-L and HA-R). (b) Cotransfection of the donor plasmid and the 
plasmid encoding MSTN TALENs into HEK293 cells resulted in 
the formation of many mCherry and ECFP-positive clones after 
puromycin selection for a week. Most clones formed in the cells 
transfected with only donor plasmid do not show both mCherry 
and ECFP fluorescence, indicating random integration events. 
Scale bar = 100 |jm. (c) Genotyping analysis of individual colonies 
isolated from the HEK293 cells transfected with either donor only 
or donor plus TALENs using the primer combinations as follows: F5 
and R1 , F4 and R4, or F2 and R2. Nontransfected cells were used 
as control (Ctrl). 

to connpletely knock out the expression of nnyostatin. Indeed, 
western blotting and functional analyses dennonstrated that 
TALEN-nnediated MSTN gene disruption led to drannatic 
reduction in the expression of nnyostatin (Figure 7b and 7c). 
Interestingly, even in-franne deletions still disrupted the nnyo- 
statin expression (data not shown). These data suggest that 
targeting exon 2 is a viable approach to disable the function 
of nnyostatin. Our data also suggest that nnyostatin-induced 
atrophy is, at least in part, through the expression of nnyo- 
statin. Thus, the TALEN-nnediated gene disruption approach 
can allow rapid gene function analysis in cell culture. 

The target sites of our TALEN pair are highly conserved in 
nnannnnals (see Supplementary Figure S3 online). In partic- 
ular, these sequences are exactly the sanne in prinnates and 
farnn aninnals such as pig, cattle and horse. Thus, our MSTN 
TALENs are predicted to work in all prinnates and nnany other 
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mammals. Rapid generation of MSTN knockout cells and 
animals from these species could be achieved using this 
single pair of TALENs. Already, TALEN-induced knockout and 
knockin farm animals have been developed in pigs and cows 
with high efficiency/^"^^ 

The MSTN TALENs seem to be highly specific. In the human 
genome, there are only 1 0 similar sequences to the target site 
(see Supplementary Table S2 online), and we did not detect 
any off-target activity at three most similar target sites using 
the T7E1 assay (Figure 5). These data suggest that these 
TALENs are highly sequence specific. However, it is worth 
noting that there are limitations in detecting genome-editing 
activities using theT7E1 assay. For example, it is not possible 
to detect rare genome editing events using this assay 

The ability of TALENs to stimulate HR by creating dou- 
ble-strand break is of interest in context of targeted gene 
addition in cell engineering for cell therapy or other pur- 
poses. Our data suggest that a large DNA sequence of 
at least 9.3kb can be efficiently integrated via HR into the 
human MSTN locus. In particular, targeted addition of a 
therapeutic gene such as dysferlin, which is defective in a 
group of muscular dystrophy patients, into the MSTN locus 
of myoblasts would offer significant benefit for a myoblast- 
based therapy approach. MSTN is typically expressed 
in developing and adult skeletal muscle;^ addition of the 
therapeutic genes for muscular dystrophies into the MSTN 
locus would allow persistent expression of such genes in 
skeletal muscle. Our experiment carried out on a human 
cell line demonstrated that a dysferlin expression cassette 
can be efficiently integrated into the human MSTN locus, 
allowing persistent and uniform expression of dysferlin. It 
would be interesting to see whether this MSTN TALEN pair 
can facilitate gene addition into primary human and mouse 
myoblasts, which can be used for myoblast transplantation 
therapy in the future. 

In summary, TALEN is an effective genome-editing tool. 
Application of MSTN TALENs in a variety of cell lines and 
species would allow further investigation of myostatin func- 
tions in mammalian animals that do not have naturally occur- 
ring mutant MSTN models yet. Moreover, this TALEN pair 
would also be a valuable tool for cell engineering in transla- 
tional research such as myoblast transplantation therapy to 
replace defective genes in genetic diseases including mus- 
cular dystrophy. 

Materials and methods 

Assembly of MSTN-TALENs.\Ne constructed a pair of TALENs 
to target the MSTN locus using the published Golden Gate 
platform.27 The Golden Gate TALEN kit (Kit#1 00000001 6) 
was obtained from Addgene. TALENs were assembled in the 
same way as described^^ with the following modifications. 
Instead of the pTAL scaffold,^^ we used the GoldyTALEN 
scaffold"^^ to construct our TALENs because GoldyTALEN 
scaffold has an increased gene-editing efficiency. The 
GoldyTALEN scaffold was constructed by PGR using the 
following primer pairs: TAL-F1, CAAGGTACCTATGGT 
GGATCTACGCACGCTCGGCTACAG; TAL-R1, AGGGTCGA 
CGTCTCCAGGGGAGCACCCGTCAGTGCATTG; TAL-F2, 
GACGTCGACCGTCTCCAACGACCACCTCGTC; TAL-R2, 



TTGGGATCCGGCAACGCGATGGGACGTGCGTTC. The 
two PGR fragments were ligated into the Kpnl and BamHI 
sites of a mammalian expression plasmid designated as 
pTAL5, which is based upon pEGFP-C3 with the following 
modifications: (i) a SxFLAG tag and nucleus localization 
signal (NLS) sequence, (ii) a wild-type (WT) Fokl domain, 
and (iii) Bsal and BsmBI sites removed outside of the Gold- 
yTALEN scaffold. The final pTAL5 plasmid contains a unique 
BsmBI site within the scaffold in order to be compatible with 
the Golden Gate platform. To construct TALENs with obligate 
heterodimer variants of the Fokl domain (ELD-sharkey and 
KKR-sharkey), we used QuikChange II Site-Directed Muta- 
genesis kit (Agilent Technologies, Cedar Creek, TX) to obtain 
the ELD-sharkey and KKR-sharkey variants. To streamline 
the assembly of a functional pair of TALENs into one plas- 
mid, we placed two GoldyTALEN scaffolds linked by a self- 
cleaving 2A peptide into a final plasmid designated as pTAL6, 
which has a unique Bsal site instead of BsmBI site within 
the second GoldyTALEN scaffold. Both pTAL5 and pTAL6 
are available upon request. The protein sequences of MSTN 
TALENs are provided in the Supplementary Figures S1 and 
S2 online). 

Construction of mouse and human donor plasmids. 
The homology arms (-800 bp) of the human and mouse 
MSTN locus were PGR amplified from genomic DNA 
of mouse C2C12s cell or human HEK293 cells. The 
primer sequences are as follows: hGDF8-HL-F: 5'-ATC 
TACTAGTGCCTGGCCCTAAAGACAAT-3'; hGDF8-HL-R: 5'- 
ATCTGGTACCTCTAGATTGTAGGAGTCTCGACGGG-3'; 
hGDF8-HR-F:5'-TCTGGTACCGATATCCTCTGAAACTTGAC 
ATGAACCC-3'; hGDF8-HR-R: 5'-ATCTGCGGCCGCCCAC 
ATCAGTGCATCAACATCC-3'; mGDF8-HL-L: CGTACTAGT 
CAAGGCCACTGCTTTCTGAT-3'; mGDF8-HL-R: AGACTCG 
AGAAACACTGTTGTAGGAGTCTTGAC-3'; mGDF8-HR-L: A 
GCCTCGAGAGCGATATCCGATCTCTGAAACTTGAC-3'; 
mGDF8-HR-R:ATCTGCGGCCGCGCAAGTATGCTAAAGGA 
GTCCA-3'. PGR products for left and right arms were digested 
with Spel/Kpnl and KpnI/NotI, respectively and ligated into 
SpelandNotI restriction sites of AAVS1 SA-2A-puro-pA donor 
plasmid (#22075 from Addgene) to generate human and 
mouse GDF8 donor plasmids. To add a mCherry-puromycin 
expression cassette with a CMV promoter, we assembled the 
following pieces together: (i) a CMV promoter was amplified 
from pmCherry-C1 plasmid (Clontech, Mountain View, CA) 
with primers: CMV-F, AGTGGTCTCCACCGTCGACTAGTTAT 
TAATAGTAATCAATTACGGGGTC-3' and CMV-R, ACTCTC 
GAGAGCGCTAGCGGATCTGACGGTTCACTAAAC-3' 
and the PGR product was digested with Spel and Nhel; (ii) 
mCherry fragment was obtained by digesting pmCherry-C1 
plasmid (Clontech, Mountain View, CA) with Nhe1 and Sail; 
and (iii) puromyocin was obtained from AAVS1 SA-2A-puro- 
pA donor plasmid (#22075 from Addgene) by Xhol and Notl. 
These three fragments were ligated into the Spel and Notl 
site of the human or mouse GDF8 donor plasmids to cre- 
ate the final human or mouse GDF8 donor plasmids carrying 
a complete mCherry-puromycin expression cassette. More- 
over, a ECFP-dysferlin fragment fused to mCherry-puromycin 
with a 2A self-cleaving peptide sequence was added to form 
the mCherry-puromycin-2A-ECFP-dysferlin donor plasmids. 
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Cell culture and transfection. HEK293, BAEC, NIH 3T3, 
HT1080, and C2C12 cells were cultured in DMEM supple- 
mented with 10% FBS. Cells were seeded in six-well plates 
until they reached -80% confluence. HEK293 cells were 
transfected with 2 |jg TALENs-encoding plasmid using 
X-tremeGENE HP DNA transfection reagent (Roche, India- 
napolis, IN) and media was changed after 24 hours. Typi- 
cally, this resulted in about 60% transfection efficiency BAEC, 
NIH3T3, HT1080, and C2C12 cells were transfected with 7.5 
|jg TALENs-encoding plasmid with Xfect transfection reagent 
(Clontech, Mountain View, CA) and media was changed after 
4 hours. The cells were assayed 48 hours after transfection. 
The transfection efficiency for BAEC varies from 20 to 30%, 
and for NIH 3T3, C2C12 and HT1080 cells varies from 10 to 
20% (we thus did three rounds of transfection). Primary mouse 
and human myoblasts (#10067 control patient and #9501 dys- 
ferlin-deficient patient, obtained from Telethon Genetic Bio- 
Bank Network) were cultured in DMEM/F-12 supplemented 
with 20% FBS. These primary myoblasts were transfected with 
Neon Transfection System (Invitrogen, Carlsbad, CA). Briefly, 
1x10^ cells were electroporated with 0.5 |jg TALENs-encoding 
plasmid. The electroporation conditions for mouse FDB myo- 
blasts and human myoblasts were 1700V, 20 ms, 1 pulse and 
1400V, 20 ms, 2 pulses, respectively. These conditions consis- 
tently resulted in about 50% transfection efficiency in mouse 
FDB myoblasts with about 20% survival rate and 30% trans- 
fection efficiency in human myoblasts with 20% survival rate. 

T7E1 mismatch-detecting assay. The cleavage activities 
of the MSTN TALENs were assayed by mismatch-recog- 
nizing T7E1 as described previously.^^ The T7E1 assay 
detects small deletion/insertion mutations (indels) origi- 
nated from NHEJ DNA repair events following TALENs- 
induced double-strand break. Briefly, the cells transfected 
with TALENs-expressing plasmid were harvested 3 days 
post-transfection and genomic DNA was extracted. A DNA 
fragment surrounding the TALEN target site was amplified 
by PCR with the AccuPrime PCR kit (Invitrogen, Carlsbad, 
CA). The primer pairs were 5'-TGGAGGGGTTTTGTTAA 
TGG-3' and 5'-TATTGGGTACAGGGCTACCG-3' for human, 
5'-AGTGGTCTCACTATACGTACACACTACCCCAACAGC-3' 
and 5'- AGTGGTCTCACGCCCATGGGACATGAGATTGACA 
CA-3' for mouse, and 5'-TCC CGAGGCTCAGTTAGTTGC-3' 
and 5'-CACTGGGGTAAGGCACCTTTG-3' for bovine. The 
primer pairs used to detect off-target activities were 5'- TCTT 
ATCTGCTGGGCCACTC-3' and 5'- CTGCTCCCGTTTTCTG 
TAGC-3' for human chromosome 5 site, 5'- CACAGGACATG 
TGGGAACAG-3' and 5'- GCCCAATGGAAAATCGTATG-3' 
for human chromosome 12 site, 5'- GTTGTGGGACC 
AAAGACGAT-3' and 5'- ACGCTGGGAATTTCCTCTCT-3' for 
human chromosome 2 site. The DNA fragment was purified 
and denatured at 95 °C for 1 0 minutes, and reannealed slowly 
using the following temperature program: 90 cycles of 95 to 
59 °C with a 0.4 °C decrease per cycle for 20 seconds, 90 
cycles of 59 to 32 °C with a 0.3 °C decrease per cycle for 20 
seconds, 20 cycles of 32 to 26 °C with a 0.3 °C decrease per 
cycle for 20 seconds. This allows the formation of DNA het- 
eroduplex if NHEJ occurred. The reannealed DNA samples 
were incubated with 0.5 |jl T7E1 (New England BioLabs, UK) 
for 45 minutes and subjected to electrophoresis on a 2% TAE 



agarose gel. The gels were stained with ethidium bromide and 
imaged using Chemidoc (BioRad, Hercules, CA). Densiomet- 
ric quantification of DNA bands was done using Imaged. Muta- 
tion frequencies were calculated using the formula: fractional 
modification = 1- (1- (fraction cleaved))°^ as described.^^ 

Fluorescence-activated cell sorting (FACS). Two days after 
transfection with EGFP-tagged MSTN-TALEN plasmid, NIH 
3T3 cells and C2C12 cells were sorted by FACS (FACSAria II, 
BD) to enrich EGFP-positive cells. The sorted EGFP-positive 
cells were further cultured for 1 week and then the genomic 
DNA was extracted. The genomic DNA of sorted cells was 
analyzed for gene-editing activities by T7E1 assay. DNA frag- 
ment surrounding the TALEN target sites were PCR amplified 
from the genomic DNA of sorted cells with a forward primer 
5'-AGTGGTCTCACTATACGTACACACTACCCCAACAGC-3' 
and a reverse primer 5'-AGTGGTCTCACGCCCATGGGA 
CATGAGATTGACACA-3'. The PCR products were digested 
with SnaBI and Ncol restriction enzymes, and subcloned into 
a temporary vector based on pFastBac (Invitrogen, Carls- 
bad, CA).Ten clones were randomly picked up for direct DNA 
sequencing. 

PCR genotyping of targeted integration. HEK293 cells in six 
well plates were transfected with 1 |jg MSTN-TALEN and 
1 |jg human donor plasmid using X-tremeGENE HP DNA 
transfection Reagent (Roche, Indianapolis, IN). C2C12 cells 
were transfected with 4 |jg MSTN-TALEN and 6 |jg mouse 
donor plasmid using Xfect reagent (Clontech). After 48 hours, 
genomic DNA was extracted and targeted integration events 
in cell lines were identified by PCR analysis. The primers 
used are provided in Supplementary Table S1 online. 

Fluorescence microscopy. Fluorescence and bright-field 
images were taken with NIS-Elements Advanced Research 
software package (Nikon, Tokyo, Japan) using an inverted 
Nikon Ti-E microscope equipped with a Xenon lamp (Hama- 
matsu Photonics Systems, Bridgewater, NJ), a 40x 1.30 NA 
objective (Nikon, Tokyo, Japan), and an Evolve 512 EMCCD 
camera (Photometries, Pleasanton, CA). The EMCCD cam- 
era was cooled to -80 °C during imaging. ECFP-dysferlin 
integrated cells were also imaged with a confocal microscope 
(TCS-SP5, Leica Microsystems, Wetzlar, Germany), using 
the 514nm line of an argon continuous laser as the excita- 
tion source. Fluorescence emission was collected with a 63x 
water immersion objective (HCX PL APO, 1 .2 NA). 

Western blotting. Cells were lysed with cold RlPA buffer 
supplemented with protease inhibitors and extracted pro- 
tein samples were separated by SDS-PAGE and transferred 
onto Nitrocellulose membranes (0.45 |jm).The mouse mono- 
clonal anti-FLAG (Sigma-Aldrich, Saint Louis, MO), rabbit 
polyclonal anti-myostatin (#ab98337. Abeam), and rabbit 
monoclonal anti-GAPDH (Cell Signaling) antibodies were 
used for immunoblotting analysis. HRP conjugated rabbit 
antimouse and goat antirabbit secondary antibodies were 
obtained from Millipore, Billerica, MA. The membranes were 
developed using ECL2 western blotting substrate (Pierce 
Biotechnology, Rockford, IL) and imaged using ChemiDoc 
XRS+ system with Image Lab software (Bio-Rad). 
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Measurement of myotube diameter. Myotube cultures were 
photographed with a Nikon Ti-E microscope (as mentioned 
above) after 3 days and 6 days differentiation. Dexamethe- 
sone (100 pM) was added into the cultures on day 3 to day 
6. The diameters were measured as previously described/"^ 
Briefly, a total of 35-62 myotubes in different groups from 
at least five random fields were measured using Imaged 
software (NIH, Frederick, MD). The measurements were 
conducted in a "blinded" fashion on coded pictures with the 
investigator being unaware of the group from which the cul- 
tures originated. Results were expressed as per cent of the 
diameters on day 3. 

Supplementary material 

Table S1. Primers used for genotyping analysis of HR inte- 
gration. 

Table S2. List of potential off-target sites of the MSTN 
TALEN pair. 

Figure S1. Amino acid sequences of the left MSTN TALEN. 
Figure S2. Amino acid sequences of the right MSTN 
TALEN. 

Figure S3. Sequence alignment of the MSTN genes from 

various species at the TALEN target sites. 

Figure S4. Sequence variants in additional clones (a) and 

estimation of biallelic genome-editing activities (b). 

Figure S5. Genotyping analysis of C2C1 2 cells transfected 

with either donor (mCherry-puromycinR) only, TALENs only, 

or donor plus TALENs. 

Figure S6. Confocal images of HEK293 cells integrated 
with mCherry-puromycinR-2A-ECFP-dysferlin donor. 
Figure S7. Genotyping analysis of HEK293 cells transfect- 
ed with either donor (mCherry-puromycinR-2A-ECFP-dysfer- 
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